Klasifikasi Berita Menggunakan Metode K-Nearest Neighbor
نویسندگان
چکیده
Abstrak - Meningkatnya minat masyarakat dalam mengakses berita, khususnya berita online, menuntut redaktur dan situs portal untuk memberikan liputan yang berkualitas. Selain itu, klasifikas ada masih tergolong umum dapat menjadi kendala dialami pembaca. jika pembaca ingin melihat kategori lebih spesifik, mereka harus menyaring tersebut secara manual. Hal ini juga terjadi di bidang sosial Badan Pusat Statistik Provinsi Riau kesulitan mencari tentang Riau. Oleh karena proses klasifikasi menggunakan metode k-nearest neighbor hal krusial dilakukan. Jumlah digunakan penelitian berjumlah 510 data dengan tiga yaitu demokrasi, kemiskinan, ketenagakerjaan. Proses meliputi: pengumpulan data, pelabelan manual, preprocessing teks, pembobotan kata, memakai neighbor. cosinus similarity meningkatkan nilai akurasi. Nilai akurasi tertinggi diperoleh pada adalah 87% k = 3 distribusi uji 20% latih dari 80%. Dari diambil kesimpulan bahwa K-Nearest Neighbor bekerja baik berita.Kata kunci: Statistik, Berita, Cosine Similarity, Klasifikasi, Abstract The increasing of public interest in accessing news, especially online requires editors and news sites to provide quality coverage news. In addition, the grouping that still classified as a general can be an obstacle experienced by readers. if reader wants see more specific category they must filter manually. This is also happened social sector Riau, which has trouble when finding about Province. Therefore, classification process using method crucial thing do. number stories used this study amounted with three categories, democracy, poverty, employment. includes: collection, manual labeling, text preprocessing, word weighting, method. Besides that, cosine increase accuracy value. highest values obtained were distribution test training From research, it concluded works well process.Keywords: Classification, Neighbor, News
منابع مشابه
Drought Monitoring and Prediction using K-Nearest Neighbor Algorithm
Drought is a climate phenomenon which might occur in any climate condition and all regions on the earth. Effective drought management depends on the application of appropriate drought indices. Drought indices are variables which are used to detect and characterize drought conditions. In this study, it was tried to predict drought occurrence, based on the standard precipitation index (SPI), usin...
متن کاملFast Approximate Nearest-Neighbor Search with k-Nearest Neighbor Graph
We introduce a new nearest neighbor search algorithm. The algorithm builds a nearest neighbor graph in an offline phase and when queried with a new point, performs hill-climbing starting from a randomly sampled node of the graph. We provide theoretical guarantees for the accuracy and the computational complexity and empirically show the effectiveness of this algorithm.
متن کاملUnsupervised K-Nearest Neighbor Regression
In many scientific disciplines structures in highdimensional data have to be found, e.g., in stellar spectra, in genome data, or in face recognition tasks. In this work we present a novel approach to non-linear dimensionality reduction. It is based on fitting K-nearest neighbor regression to the unsupervised regression framework for learning of low-dimensional manifolds. Similar to related appr...
متن کاملNeighbor-weighted K-nearest neighbor for unbalanced text corpus
Text categorization or classification is the automated assigning of text documents to pre-defined classes based on their contents. Many of classification algorithms usually assume that the training examples are evenly distributed among different classes. However, unbalanced data sets often appear in many practical applications. In order to deal with uneven text sets, we propose the neighbor-wei...
متن کاملEvolving edited k-Nearest Neighbor Classifiers
The k-nearest neighbor method is a classifier based on the evaluation of the distances to each pattern in the training set. The edited version of this method consists of the application of this classifier with a subset of the complete training set in which some of the training patterns are excluded, in order to reduce the classification error rate. In recent works, genetic algorithms have been ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Jurnal nasional komputasi dan teknologi informasi
سال: 2022
ISSN: ['2620-8342', '2621-3052']
DOI: https://doi.org/10.32672/jnkti.v5i2.4192